Search results for "Data mapping"

showing 4 items of 4 documents

Towards interpretable classifiers with blind signal separation

2012

Blind signal separation (BSS) is a powerful tool to open-up complex signals into component sources that are often interpretable. However, BSS methods are generally unsupervised, therefore the assignment of class membership from the elements of the mixing matrix may be sub-optimal. This paper proposes a three-stage approach using Fisher information metric to define a natural metric for the data, from which a Euclidean approximation can then be used to drive BSS. Results with synthetic data models of real-world high-dimensional data show that the classification accuracy of the method is good for challenging problems, while retaining interpretability.

business.industryPattern recognitionBlind signal separationSynthetic dataData mappingsymbols.namesakeComponent (UML)Metric (mathematics)symbolsArtificial intelligenceFisher informationbusinessFisher information metricInterpretabilityMathematics
researchProduct

Q-nexus: a comprehensive and efficient analysis pipeline designed for ChIP-nexus

2016

Background: ChIP-nexus, an extension of the ChIP-exo protocol, can be used to map the borders of protein-bound DNA sequences at nucleotide resolution, requires less input DNA and enables selective PCR duplicate removal using random barcodes. However, the use of random barcodes requires additional preprocessing of the mapping data, which complicates the computational analysis. To date, only a very limited number of software packages are available for the analysis of ChIP-exo data, which have not yet been systematically tested and compared on ChIP-nexus data. Results: Here, we present a comprehensive software package for ChIP-nexus data that exploits the random barcodes for selective removal …

0301 basic medicineFOS: Computer and information sciencesDuplication ratesChromatin ImmunoprecipitationBioinformaticsPipeline (computing)610Biologycomputer.software_genre600 Technik Medizin angewandte Wissenschaften::610 Medizin und Gesundheit03 medical and health sciencesSoftwareChIP-nexusGeneticsPreprocessorNucleotide MotifsLibrary complexityChIP-exoGeneticsProtocol (science)Binding Sitesbusiness.industryfungiComputational BiologyHigh-Throughput Nucleotide SequencingReproducibility of ResultsChipChromatin immunoprecipitationData mappingDNA-Binding ProteinsAlgorithm030104 developmental biologyChIP-exoData miningbusinessPeak callingcomputerAlgorithmsSoftwareProtein BindingTranscription FactorsResearch ArticleBiotechnologyBMC Genomics
researchProduct

Executable Data Quality Models

2017

The paper discusses an external solution for data quality management in information systems. In contradiction to traditional data quality assurance methods, the proposed approach provides the usage of a domain specific language (DSL) for description data quality models. Data quality models consists of graphical diagrams, which elements contain requirements for data object's values and procedures for data object's analysis. The DSL interpreter makes the data quality model executable therefore ensuring measurement and improving of data quality. The described approach can be applied: (1) to check the completeness, accuracy and consistency of accumulated data; (2) to support data migration in c…

Computer scienceData transformation02 engineering and technologycomputer.software_genreData modeling0203 mechanical engineering0202 electrical engineering electronic engineering information engineeringInformation systemLogical data modelGeneral Environmental ScienceData elementDatabaseInformation qualityData warehouseData mapping020303 mechanical engineering & transportsData modelData qualityGeneral Earth and Planetary Sciences020201 artificial intelligence & image processingData pre-processingData architectureData miningSoftware architecturecomputerData migrationData virtualizationProcedia Computer Science
researchProduct

A proposed mapping method for aligning machine execution data to numerical control code

2019

The visions of the digital thread and smart manufacturing have boosted the potential of relating downstream data to upstream decisions in design. However, to date, the tools and methods to robustly map across the related data representations is significantly lacking. In response, we propose a mapping technique for standard manufacturing data representations. Specifically, we focus on relating controller data from machining tools in the form of MTConnect, an emerging standard that defines the vocabulary and semantics as well as communications protocols for execution data, and G-Code, the most widely used standard for numerical control (NC) instructions. We evaluate the efficacy of our mappin…

0209 industrial biotechnologyVocabulary021103 operations researchComputer sciencemedia_common.quotation_subject0211 other engineering and technologies02 engineering and technologyThread (computing)computer.software_genreData mappingData modeling020901 industrial engineering & automationMachiningMTConnectNumerical controlData miningCommunications protocolcomputermedia_common2019 IEEE 15th International Conference on Automation Science and Engineering (CASE)
researchProduct